Search CORE

42 research outputs found

Learning Parse and Translation Decisions From Examples With Rich Context

Author: Hermjakob Ulf
Mooney Raymond J.
Publication venue
Publication date: 01/01/1997
Field of study

We present a knowledge and context-based system for parsing and translating natural language and evaluate it on sentences from the Wall Street Journal. Applying machine learning techniques, the system uses parse action examples acquired under supervision to generate a deterministic shift-reduce parser in the form of a decision structure. It relies heavily on context, as encoded in features which describe the morphological, syntactic, semantic and other aspects of a given parse state.Comment: 8 pages, LaTeX, 3 postscript figures, uses aclap.st

arXiv.org e-Print Archive

CiteSeerX

Crossref

Extracting Biomolecular Interactions Using Semantic Parsing of Biomedical Text

Author: Galstyan Aram
Garg Sahil
Hermjakob Ulf
Marcu Daniel
Publication venue
Publication date: 04/12/2015
Field of study

We advance the state of the art in biomolecular interaction extraction with three contributions: (i) We show that deep, Abstract Meaning Representations (AMR) significantly improve the accuracy of a biomolecular interaction extraction system when compared to a baseline that relies solely on surface- and syntax-based features; (ii) In contrast with previous approaches that infer relations on a sentence-by-sentence basis, we expand our framework to enable consistent predictions over sets of sentences (documents); (iii) We further modify and expand a graph kernel learning framework to enable concurrent exploitation of automatically induced AMR (semantic) and dependency structure (syntactic) representations. Our experiments show that our approach yields interaction extraction systems that are more robust in environments where there is a significant mismatch between training and test conditions.Comment: Appearing in Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Translating with Scarce Resources

Author: and Kenji Yamada
Daniel Marcu
Kenji Yamada
Kevin Knight
Philipp Koehn
Ulf Hermjakob
Ulrich Germann
Yaser Al-onaizan
Publication venue
Publication date: 01/01/2000
Field of study

Current corpus-based machine translation techniques do not work very well when given scarce linguistic resources. To examine the gap between human and machine translators, we created an experiment in which human beings were asked to translate an unknown language into English on the sole basis of a very small bilingual text. Participants performed quite well, and debriefings revealed a number of valuable strategies. We discuss these strategies and apply some of them to a statistical translation system

CiteSeerX

Edinburgh Research Explorer

Abstract Meaning Representation for Sembanking

Author: Banarescu Laura
Bonial Claire
Cai Shu
Georgescu Madalina
Griffitt Kira
Hermjakob Ulf
Knight Kevin
Koehn Philipp
Palmer Martha
Schneider Nathan
Publication venue
Publication date: 01/01/2013
Field of study

We describe Abstract Meaning Representation (AMR), a semantic representation language in which we are writing down the meanings of thousands of English sentences. We hope that a sembank of simple, whole-sentence semantic structures will spur new work in statistical natural language understanding and generation, like the Penn Treebank encouraged work on statistical parsing. This paper gives an overview of AMR and tools associated with it

CiteSeerX

Edinburgh Research Explorer

NERO: a biomedical named-entity (recognition) ontology with a large, annotated corpus reveals meaningful associations through text embedding.

Author: Alachram Halima
Ambite José Luis
Ananiadou Sophia
Beißbarth Tim
Chambers Brendan
Christopoulou Fenia
Evans James A
Galstyan Aram
Gao Xin
Garg Sahil
Hermjakob Ulf
Khomtchouk Bohdan B
King Ross
Li Maolin
Li Yu
Marcu Daniel
Matthew Joel
Pan Weidi
Rzhetsky Andrey
Schoene Annika M
Sheng Emily
Soldatova Larisa
Stevens Robert
Wang Kanix
Wingender Edgar
Publication venue: NPJ Syst Biol Appl
Publication date: 01/01/2021
Field of study

Machine reading (MR) is essential for unlocking valuable knowledge contained in millions of existing biomedical documents. Over the last two decades1,2, the most dramatic advances in MR have followed in the wake of critical corpus development3. Large, well-annotated corpora have been associated with punctuated advances in MR methodology and automated knowledge extraction systems in the same way that ImageNet4 was fundamental for developing machine vision techniques. This study contributes six components to an advanced, named entity analysis tool for biomedicine: (a) a new, Named Entity Recognition Ontology (NERO) developed specifically for describing textual entities in biomedical texts, which accounts for diverse levels of ambiguity, bridging the scientific sublanguages of molecular biology, genetics, biochemistry, and medicine; (b) detailed guidelines for human experts annotating hundreds of named entity classes; (c) pictographs for all named entities, to simplify the burden of annotation for curators; (d) an original, annotated corpus comprising 35,865 sentences, which encapsulate 190,679 named entities and 43,438 events connecting two or more entities; (e) validated, off-the-shelf, named entity recognition (NER) automated extraction, and; (f) embedding models that demonstrate the promise of biomedical associations embedded within this corpus

Goldsmiths Research Online

Directory of Open Access Journals

Chalmers Research

Apollo (Cambridge)

Parsing and question classification for question answering

Author: Ulf Hermjakob
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2001
Field of study

This paper describes machine learning based parsing and question classification for question answering. We demonstrate that for this type of application, parse trees have to be semantically richer and structurally more oriented towards semantics than what most treebanks offer. We empirically show how question parsing dramatically improves when augmenting a semantically enriched Penn treebank training corpus with an additional question treebank

CiteSeerX

Crossref

v

Author: To My Parents Udo
Ulf Hermjakob
Ulf Hermjakob
Ulf Hermjakob Dipl. -inform
Publication venue
Publication date
Field of study

for their loving support and continuous encouragement for a good education Acknowledgments Iamvery thankful for the contributions of my rst advisor, the late Robert F. Simmons, who helped me develop the basic ideas underlying my doctoral research. I also deeply appreciate the dedicated support of my current advisor, Raymond J. Mooney, who guided the completion of my dissertation with many helpful insights. Many thanks for valuable comments also to the othe

CiteSeerX